Knowledge Discovery from the Web

نویسندگان

  • Maryam Hazman
  • Samhaa R. El-Beltagy
  • Ahmed A. Rafea
  • Salwa El-Gamal
چکیده

The World Wide Web is a rich resource of information and knowledge. Within this resource, finding relevant answers to some given question is often a time consuming activity for a user. In the presented work we construct a web mining technique that can extract information from the web and create knowledge from it. The extracted knowledge can be used to respond more intelligently to user requests within the diagnosis domain. Our system has three main phases namely: a categorization phase, an indexing phase, and search a phase. The categorization phase is concerned with extracting important words/phrases from web pages then generating the categories included in them. The indexing phase is concerned with indexing web page sections. While the search phase interacts with the user in order to find relevant answers to their questions. The system was tested using a training web pages set for the categorization phase. Work in the indexing and search phase is still in going.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expert Discovery: A web mining approach

Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...

متن کامل

Designing an Ontology for Knowledge Discovery in Iran’s Vaccine

Ontology is a requirement engineering product and the key to knowledge discovery. It includes the terminology to describe a set of facts, assumptions, and relations with which the detailed meanings of vocabularies among communities can be determined. This is a qualitative content analysis research. This study has made use of ontology for the first time to discover the knowledge of vaccine in Ir...

متن کامل

Exploring Relevance as Truth Criterion on the Web and Classifying Claims in Belief Levels

The Web has become the most important information source for most of us. Unfortunately, there is no guarantee for the correctness of information on the Web. Moreover, different websites often provide conflicting information on a subject. Several truth discovery methods have been proposed for various scenarios, and they have been successfully applied in diverse application domains. In this paper...

متن کامل

Integration of Semantic Web and Knowledge Discovery for Enhanced Information Retrieval

Knowledge management is a process which comprises knowledge discovery, knowledge collection , knowledge organization and knowledge process. Among these four process knowledge discovery is integrated with semantic web for enhanced information retrivel. Knowledge discovery is the process of automatically searching large volume of data for patterns that can be considered knowledge about the data. ...

متن کامل

Knowledge discovery – semantic web

Knowledge management is a process which comprises knowledge discovery, knowledge collection, knowledge organization and knowledge process. Among these four process knowledge discovery is integrated with semantic web for enhanced information retrivel. Knowledge discovery is the process of automatically searching large volume of data for patterns that can be considered knowledge about the data. T...

متن کامل

Integration of Semantic Web and Knowledge Discovery for Enhanced Information Retrieval

Knowledge management is a process which comprises knowledge discovery, knowledge collection, knowledge organization and knowledge process. Among these four process knowledge discovery is integrated with semantic web for enhanced information retrivel. Knowledge discovery is the process of automatically searching large volume of data for patterns that can be considered knowledge about the data. T...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005